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ABSTRACT 

Using a hydrodynamics plus Y-body simulation of galaxy cluster formation within a large volume 
and mock Chandra X-ray observations, we study the form and evolution of the intrinsic scatter about 
the best-fit X-ray temperature-mass relation for clusters. We investigate the physical origin of the 
scatter by correlating it with quantities that are closely related to clusters' formation and merging 
histories. We also examine the distribution of the scatter for merging and nonmerging populations, 
identified using halo merger trees derived from the simulation as well as X-ray substructure measures. 
We find a strong correlation between the scatter in the M — Tx relation and the halo concentration, 
in the sense that more concentrated clusters tend to be cooler than clusters with similar masses. No 
bias is found between the merging and relaxed clusters, but merging clusters generally have greater 
scatter, which is related to the properties of the distribution of halo concentrations. We also detect a 
signature of non-lognormality in the distribution of scatter for our simulated clusters both at z = 
and at z = 1. A detailed comparison of merging clusters identified by substructure measures and 
by halo merger trees is given in the discussion. We conclude that, when cooling-related effects are 
neglected, the variation in halo concentrations is a more important factor for driving the intrinsic 
scatter in the M — Tx relation, while departures from hydrostatic equilibrium due to cluster mergers 
have a minor effect. 

Subject headings: galaxies: clusters: general — hydrodynamics — intergalactic medium — X-rays: 
galaxies 



1. INTRODUCTION 

Galaxy clusters are potentially valuable cosmological 
probes because of their unique position in the hierarchy 
of structure formation. They are the largest gravitation- 
ally bound objects, having just separated from the cos- 
mic expansion and collapsed from density fluctuations 
in the past few billion years. Therefore, statistical mea- 
sures of clusters, such as their mass distribution as a 
function of rcdshift, are sensitive to the cosmic matter 
density parameter f2 m , the dark energy density param- 
eter fide, the normalization of the primordial fluctua- 
tion spectrum ogi and the dark en ergy equation of state 
parameter uu (jHaiman et al.ll20"05T ). Future cluster sur- 
veys will yield cosmological parameter constraints that 
are complementary to those from upcoming microwave 
background probes (e.g. Planck) and Type la supernova 
observations. 

However, clusters present us with a dilemma: their 
masses are well-predicted by numerical simulations, but 
in the real world, 80-85% of their mass is in the form 
of invisible dark matter. In order to make contact 
with observations, observational mass proxies are needed. 
Fortunately, cluster masses correlate with many ob- 
servable quantities, such as X-ray temperature Tx, X- 
ray luminosity Lx, optical richness, infrared luminos- 
ity, and the Sunyaev-Zerdovich effect dMohr et al.l 1 19991 : 
Lin et all [ 2003: Po pesso et all 120051: lHansen et atlboosl : 
Stanek et a l. 2006). These mass-observable relations in- 
dicate that clusters are fairly regular and close to equilib- 
rium, despite having dynamical timescales of order 1/10 
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the age of the universe. 

For clusters to provide meaningful constraints on 
the cosmological parameters, the systematic errors in 
mass estimates based on these relations must be well- 
understood. For example, masses determined under the 
assumption of hydrostatic equilibrium in general under- 
estimate the spheri cal overdensity mass M200 by ~ 20% 
(|Nagai et a l. 2007b). The X-ray temperature determined 
through spectral fitting is also known to bias low with 
respect to the emission- weighte d temperature com monly 
used in numerical simulations (Ras ia et al.l 12005). The 
discrepancy in the normalization of the M-Tx relation 
between observations and numerical simulations has been 
alleviated only recently by taking these two effects into 
account. This emphasizes the importance of using mock- 
observation tools to make direct comparisons between re- 
sults from numerical simulations and observational data. 

The origin and distribution of scatter in these rela- 
tions are also important. Due to the exponential shape 
of the mass function, scatter in the M-X relation (for 
observable X) boosts the number density of clusters ob- 
served in logarithmic bins of X, as the overall number of 
lower-mass clusters scattering to higher values of X far 
exceeds the number of high-mass clusters scattering in 
the opposite direction. Underestimating t his scatter can 
lead t o an overestimate of erg, Ior instance (Randall et al. 
2002). Attempts have been made to reduce the ob- 
served scatter to get better constraints on cluster masses. 
Such attempts include the use of core-excised quanti- 
ties to reduce the large scatter i n the Lx~Tx relation 
due to the effects of cool cores (|Allen fc Fabianl [1998; 
•M\ tttfw».&tual. 2006) and the invention of the X-ray coun- 
terpart of the Compton y-parameter, Y x = M sas Tx, to 
obtain a very tight M-Yx correlation (|Kravtsov et al.l 
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2006). These successful examples illustrate the possibil- 
ity of obtaining better mass estimates if our knowledge of 
the physical origin of scatter is impr oved. It is also pos- 
sible to self-calibrate clus t er surveys (iLevine et alj|2002t 
iMaiumdar fc Mohrll200i iLima fe Hul l2004f ). fitting the 
mass-observable relation as an unknown together with 
the cosmological parameters. However, this technique 
requires assumptions about the functional form and the 
mass and redshift dependence of the scatter, and errors 
in these assumptions can lead to misinte rpretation of the 
obtained constraints (|Lima fc Hu 20051). 

In this paper we begin a systematic study of the in- 
fluence of internal cluster physics on the scatter in clus- 
ter mass-observable relations. We focus initially on X- 
ray observables, as they are less directly dependent on 
the uncertain details of galaxy formation than, for ex- 
ample, optical observables. Our aim in this paper is 
to examine the effects of mergers and dynamical state 
in isolation from effects due to radiative cooling, feed- 
back due to stars and black holes, and diffusive trans- 
port. Future papers will examine these other effects sep- 
arately. Accordingly, the simulation described here in- 
cludes only dark matter and gasdynamics. While we ex- 
amine substructure-based measures of dynamical state to 
make contact with previous work, we extend this work 
by considering direct measurements of dynamical state 
that may not be observable but that can give us physi- 
cal insight into the origin and form of the scatter. For 
example, we use halo merger histories derived from halo 
catalogs created every 100/i -1 Myr to identify which clus- 
ters are merging at any given epoch. § [2] describes our 
numerical methods and simulation parameters. In §[3] we 
describe our merger tree and virial analysis procedures, 
our method for generating simulated X-ray observations, 
and the substructure measures we employ. We present 
our results in § |4] and discuss them in § O Finally, we 
summarize our conclusions in § |6l 

Throughout this paper we have taken the Hubble con- 
stant to be H Q = lOO/i km s _1 Mpc" 1 , with h = 0.708. 
When quoting masses or radii defined using an overden- 
sity criterion (e.g., Af200: ^200)7 we refer to overdensitics 
relative to the critical density at the relevant epoch. 

2. COSMOLOGICAL SIMULATION 

2.1. Numerical methods 

The simulation described here was performed using 
FLASH, an Eulerian hydrodynamics plus TV-body code 
originally developed for simul ations of Type la super- 
novae and related phenomena (|Frvxell et al.| [2000). The 
flexibility of FLASH'S application framework has en- 
abled it to be applied to a wide range of problems. 
In the process it ha s been extensively validated, both 
for hydrodyn amical (ICalder et al.l [20 02) and cosmolog- 
ical TV-body (|Heitmann et al.ll2005l 120081) applications. 
We used version 2.4 of FLASH together with the local 
transform-ba sed multigrid Poisson solver described by 
iRickerl ()2008l ). The Euler equations describing the behav- 
ior of the intracluster medium (ICM) were solved using 
the Piecewise-Parabolic Method (PPM); extensive de- 
tails of the FLASH implementation of PPM are given by 
iFrvxell etT al. (2000). The iV-body component describing 
the behavior of the dark matter was handled using the 
particle-mesh technique with cloud-in-cell interpolation. 
Because we are concerned in this paper only with the 



effect of gravity-driven variations in dynamical state on 
mass-observable relations, the calculation described here 
did not employ radiative cooling or feedback due to star 
formation or active galaxies. 

2.2. Simulation details 

The results presented here are based on a FLASH 
simulation of structure formation in the ACDM cosmol- 
ogy within a 3D cubical volume spanning 256h~ 1 Mpc. 
Initial conditions were gener ated for a starting red- 
shift z of 66 using GRAFIC (|Bertschingeri [2001 with 
an initial power spectrum generated using CMBFAST 
(jSeliak fe Z aldarriaga 1996). The cosmological param- 
eter values used were cho sen to be consistent with the 
third-year WMAP results (jSpergel et al.ll2007f) : present- 
day matter density parameter f2 m o = 0.262, present-day 
baryonic density parameter f^o = 0.0437, present-day 
cosmological constant density parameter J7ao = 0.738, 
and matter power spectrum normalization ag, = 0.74. 
The simulation contains 1024 3 dark matter particles with 
a particle mass m p = 9.2 x 10 8 /i _1 M Q . The mesh used 
for the gasdynamics and potential solution was fully re- 
fined to 1024 3 zones, which corresponds to a zone spac- 
ing of 250h~ 1 kpc. Considering the effect of resolution 
on the computed abundances of halos of different mass 
(|Lukic et al.ll2007T ). with these parameters we are able 
to capture all halos containing more than 3150 particles 
(i.e. total mass 2.9 x 10 12 Ii~ 1 Mq) and 1150 particles (i.e. 
1.1 x lO 12 ^" 1 ^/©) at z = and z = l, respectively. The 
halos are identified using the friends-of-friends (FOF) al- 
gorithm. The overdensity mass and radius, Ma and Ra, 
are then found by growing spheres around each FOF cen- 
ter until the averaged total density is A times the critical 
density of the universe. 

The simulation was carried out using 800 processors of 
the Cray XT4 system at Oak Ridge National Laboratory, 
requiring a total of 16,500 CPU-hours. Gas and particle 
snapshots were written to disk every 100/i _1 Myr begin- 
ning at z = 2, yielding a total of 117 snapshots containing 
15 TB of data. 

3. ANALYSIS OF THE SIMULATION 
3.1. Merger tree analysis 

In order to directly quantify the dynamical state of 
clusters without relying on morphology, we generate 
merger trees for each cluster in our simulation and find 
the time since last merger. Here we summarize how the 
merger trees are extracted. 

First, our simulation generates output files that con- 
tain particle tags and positions every 100/i _1 Myr be- 
tween z — 2 and z — 0. We run a FOF halo finder 
with linking length parameter b = 0.2 on the particle 
positions to find all the groups containing more than 10 
particles. Although some of the very small groups are 
under our halo completeness limit, they have little effect 
on our results because we only look at minor or major 
mergers for which the mass of the smaller object is above 
our completeness limit. Thus all mergers for the objects 
we study are being counted. Between successive outputs 
at times t = t n , we find the progenitors at time 
for all the halos at t n by tracing the particle tags, which 
are uniquely assigned to each particle in the beginning 
of the simulation. A halo A is identified as a progenitor 
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of another halo B if A contains at least one particle that 
is also in B. For each halo we record the masses of its 
progenitors, their contributed masses, and the number 
of unbound particles. Then the merger trees are con- 
structed by linking all the progenitors identified in the 
previous outputs for halos above our halo completeness 
limit at z = 0. Deriving the mass accretion histories is 
straightforwardly accomplished by following the mass of 
the most massive progenitor back in time. 

To find the time since last merger for any given halo, 
we need to define what a 'merger' is. There are many 
different ways to define mergers in cosmological simula- 
tions. In our analysis we adopt two definitions: the mass- 
jump definition, in which a merger is present if there is a 
mass jump in the halo's assembly history; and the mass- 
ratio definition, which identifies a merger if the ratio 
of contributed masses from the first- an d second-ranked 
proge nitors is less than a certain value (|Cohn fc White! 
2005). To study the variations in cluster observables in- 
duced by different types of mergers, we use 1.2 and 1.33 
as thresholds for the mass-jump definition and 10:1, 5:1, 
and 3:1 in the mass-ratio definition. For each of the five 
criteria, the time since last merger is found for all clusters 
at z — and z = 1. 

In the later discussions we refer by 'merging clusters' at 
a given lookback time to those identified by at least one of 
the five merger diagnostics in the preceding 3 Gyrs, the 
typical time for clusters to re turn to virial equilibrium 
within R 500 (jPoole et al.ll2006l ). The mergers are 'major' 
if the mass jump is larger than 1.2 or if the mass ratio is 
less than 5:1; 'minor' mergers, on the other hand, have 
mass ratios between 10:1 and 5:1. 

3.2. Simulated X-ray observations 

The observational X-ray temperature of the ICM, Tx, 
usually refers to the spectroscopic temperature, obtained 
by fitting a single-temperature model to the integrated 
cluster spectrum. Using simulation-based temperature 
proxies, such as emission- wei ghted temperature, has been 
shown to overest imate Tx (Mathic sen fc Evrardl 120011 : 
iRasia et"al"1l2005D . To directly compare simulated clus- 
ter properties with observations, we create mock Chan- 
dra images for our simulated clusters using the following 
procedure. 

First, energy-dependent surface brightness maps are 
constructed by projecting X-ray emission from all the 
gas cells associated with each cluster along each of three 
orthogonal axes. The energy dependence is stored as 
the third dimension of the map between 0.04 and 10 
keV with spacing AE = 0.0498 keV. With the gas den- 
sity p and temperature T of each cell in each simula- 
tion output and an assumed metallicity Z, the X-ray 
emissivity, €e = p 2 Ae(T,Z,z), is c omputed using the 
single-temperature MEKAL model dMewe et al.l 119851: 
iKaastra fc Mewdl 19931: iLiedahl et al.lll995f) implemented 
in the utility XSPEC (|Arnaudlll99lfh 

With these energy-dependent surface brightness maps 
in hand for each cluster, we then use MARX 4 to simu- 
late Chandra X-ray observations. Given the position and 
spectrum of a source, MARX can perform ray generation, 
apply built-in models for Chandra's aspect motion and 
mirror and detector responses, and output photon event 

4 http://space.mit.edu/ASC/MARX 



files in FITS format for analysis with standard observa- 
tional tools. The user-defined source model supported 
by MARX allows us to generate light rays from astro- 
nomical sources with arbitrary shape and spectrum. We 
assign the positions and energies of photons based on the 
probability distribution defined by our 3D surface bright- 
ness maps. Each cluster is "observed" with Chandra's 
CCD chip ACIS-S using an exposure time that ensures 
2 million photons are collected, longer than typical deep 
observations. The particular long exposure time is cho- 
sen to minimize observational uncertainties since we are 
interested in the intrinsic scatter in cluster observables. 
The photon event files thus produced are processed with 
CIAO to extract the spectra within apertures of size 
-R500 centered on each cluster's surface brightness peak. 
The spectroscopic temperatures Tx are obtained by us- 
ing XSPEC to fit the spectra in the rang e 0.5 to 10 keV, 
the r ange often used by observers (e.g., Vikhlini n et al.1 
2006). We tested the above procedure using a set of 
isothermal clusters with /3-model density profiles, vary- 
ing the input temperature from 0.5 to 5 keV and using 
two different metallicities, Z — and Z = 0.3Z Q . We 
have verified that for our chosen exposure time, the input 
temperatures are recovered within la errors in all cases. 
For simplicity we assume zero metallicity in the following 
analysis. A detailed description and verification tests of 
th e same X-ray s imula tor can be found in the appendix 
of IZuhone et~atl (|2008l ). 

The left panel in Figure [T] shows Tx versus the 
emission-weighted temperature T ew measured within 
-R500 for clusters with M500 above 2 x 10 13 Mq at z = 0. 
We find that T ew is on average biased high with respect 
to T x with a fractional bias of (T e ]ll - T x )/T x ~ 23% 
consistent with previo us findings (jMathicsc n fc Evrardl 
I200U IRasia et al.ll2005ft . As pointed out by previous au- 
thors, this systematic shift is due to the superposition 
of cluster gas with different temperatures along the line 
of sight. To verify that this explains our result, we per- 
formed a test for a set of beta-model clusters whose gas 
is composed of two different temperatures, with varied 
cool-gas fractions. We found that T ew is higher than Tx 
in all cases. We can understand this result by express- 
ing these temperature measures in the following way: 
T = J W{T)TdV/ J W(T)dV, where W(T) is a weight- 
ing function. For T ew , W{T) = A(T) cx VT if dominated 
by bremsstrahlung emission. Assuming Tx can be ap- 
proximated by t he spectroscopic-l ike temperature, then 
W(T) ~ T- 3 / 4 ()Rasia et al.ll2005D . that is, T x tends to 
weight more on the cool gas in a cluster and thus is sys- 
tematically lower than T ew in general. 

To see whether this systematic shift is dependent on 
merger types, we also plot in the right panel the normal- 
ized distributions of deviations from the best-fit relation 
for relaxed clusters, minor mergers, and major mergers. 
Comparing their distributions using the Wilcoxon Rank- 
Sum test shows that the systematic shift between Tx 
and T ew is similar for relaxed clusters and minor merg- 
ers, but is larger at a statistically significant level for 
major mergers, based on our merger definitions. T his is 
also in agreement withrMathicsc n fc Evrardl (|2001f ). who 
found that the difference between Tx and T ew is larger 
for merging clusters because the spectral fit is largely 
contributed by X-ray photons from the bright and cool 
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Fig. 1. — Left: 7x,500 vs - Tern, 500 for clusters with M500 above 2 X 10 Mq at 2 = 0. The relaxed clusters, minor mergers and major 
mergers are plotted using grey, blue, and red symbols, respectively. The solid line is the best-fit relation. Comparing to the dotted line 
with a slope of 1 shows that T ew is biased high with respect to Tx ■ Right: Normalized distributions of deviations from the best-fit relation 
in Tx 500 vs - T eu)j 5oo scaling for relaxed clusters (black), minor mergers (blue), and major mergers (red). 



accreted subclumps. 



3.3. Substructure 



measures 



High-resolution X-ray images of clusters taken with 
Chandra and XMM-Newton have revealed disturbed 
ICM structures such as s hocks, bubbles, and cold front s 
(e.g. iBirzan et aT] 12004 Mailman fc Markevitchl I2004D . 
Various substructure measures have been us ed to quan- 
tify t he irregularity, such as centroid offset ([Mohr et aTl 
119951 ) and power ratios (|Buote k Tsaj|ll995l 119961) . In 
previous work this irregularity is often assumed to be 
associated with mergers. To examine the effectiveness 
of the substructure measures and the adequacy of us- 
ing them as indicators of dynamical state, we calculate 
the centroid offset and power ratios in addition to other 
theory-based definitions of cluster mergers in our simu- 
lation. 

There are many ways to define the centroid offset. For 
observed clusters, it can be defined as the variance in 
the centroids of cluster re gions above several surface- 
brightness isophotes (e.g., lO'Hara et alJ 120061 ). Since 
the offset is essentially a measure of the distance be- 
tween the surface brig htness peak and the cluster cen- 
troid, iKav et ail (|2007l ) used a simpler definition for their 
simulated clusters, 
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where i?s,maa; is the position of the surface bright- 
ness peak and i?s !Ce n is the surface-brightness centroid. 
We compared the centroid offsets calculated using both 
methods and found that these two definitions give sim- 
ilar results. We will use the latter definition hereafter 
since it is more strongly correlated with the power ratios 
for our simulated clusters. 

Power ratios are the multipole moments of surface 
brightness measured within a circular aperture centered 
on the cluster's centroid. The moments, a m and b m (de- 
fined below), are sensitive to substructures in the surface 



brightness distribution. This method is motivated by the 
multipole expansion of the two-dimensional gravitational 
potential, 

1 



y(R,<t>) = -2Ga ln[- 



2G — — (a m cosm<f> + b m smmcf)) . (2) 
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The moments a m and bm are 



a m (R) = / X(x)(R') m cos mcf)'d 2 x', 

JR'KR 

b m (R)= I Y,{x , ){R') m sinm(j)'d 2 x l , (3) 
Jr'<r 

where x' = (R',<j)'), R is the aperture radius, and X is 
the surface mass density, or surface brightness in the case 
of X-ray observations. 

The mth power P m is the azimuthal average of the am- 
plitude of \I/ m , the mth term in the multipole expansion 
of the potential given in equation J2]| , 

1 f 2v 

P ™( R ) = ^J { ^m{R, <t>)^m(R, 4>)d4>- (4) 

For m — and m > 0, we have 
P = [a Q ]n(R)} 2 
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respectively. The power ratios are thus P m /Po, the mth 
power normalized by the flux within R. 

For each cluster we compute P2/P0, P3/P0, and 
P(!)/Po. The quantities without parenthetical subscripts 
are evaluated about the surface brightness centroid 
(therefore Pi vanishes by definition). The quadrupole 
power, P2, is related to the degree of flattening or ellip- 
ticity. The next odd moment, P3, is sensitive to unequal 
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bimodal structures. Pm/Po, which is calculated about 
the surface brightness peak, measures gas distribution 
around the peak and thus is similar to the centroid off- 
set. Since the power ratios are sensitive to substructures 
at the scale of the aperture radius R, each of the power 
ratios is calculated using three different radii, P500, R200, 
and 1 Mpc, for each cluster. 

4. RESULTS 
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Fig. 2. — M500— Tx,500 relation for clusters with M500 above 
2 X 10 13 Mq at z = 0. Clusters identified by at least one of the five 
merger diagnostics are plotted using filled circles; relaxed clusters 
are plotted using open circles. The error bars in Tx (< 0.5%) are 
smaller than the symbol size and are thus omitted. The solid line 
is the best-fit relation for all clusters. 



Figure [2] shows the M^q-Tx^qo relation for clusters 
with A/500 above 2 x 10 13 M Q at z = 0. All quanti- 
ties are viewed along the x-direction in the simulation 
box. The best-fit relations for all clusters as well as dif- 
ferent subgroups are given in Table [TJ Note that the 
best-fit slopes for relaxed and merging clusters are dif- 
ferent. The difference in the slopes implies that there 
is variation in the slopes derived from observed clusters 
using different selected subsamples. This systematic un- 
certainty in the slope, which is ~ 5% by comparing the 
slopes of the merging and relaxed clusters in Table [IJ will 
contribute to the uncertainties in cosmological parame- 
ters. For example, it will translate into a systematic 
uncertainty of ~ 3% in <t 8 when using observed cluster 
samples including and excluding merging clusters. This 
is relatively small compared to the current level of uncer- 
tainty (~ 10%) in determining the cosmological parame- 
ters, but as the constraints are improved to a few percent 
in the future, one has to keep it in mind when comparing 
results using different cluster selection criteria. 

While we are able to rep roduce the self-similar M 200- 
T ew scaling relation (e.g.. iBrvan fc Norman! fl998l ) . the 
slope and normalization of the M^qq-Tx^oo relation are 
both smaller compared to observed values. This devia- 
tion is mainly due to missing baryo nic physics and the 
use of different ma ss estimates (e.g.. iBorgani et aT1l2004t 
iNagai et alj|2007 a). Matching the slope and normaliza- 



tion of the scaling relations with observed values is a sub- 
ject of great interest on its own and will be investigated 
in our future papers when we incorporate more realis- 
tic models for the baryonic component. For the purpose 
of this paper, we will focus on the scatter in these rela- 
tions and the relative importance of clusters of different 
merger histories. 

TABLE 1 

Best-fit parameters in the mass-temperature relations, 

log(Tx,50o) = a + felog(Af 50 o), FOR CLUSTERS AT z = 0. ERRORS 
IN THE PARAMETERS ARE THE la ERRORS. 



Subgroup 


Count 


a 


b 


All 


619 


-8.067 ± 0.004 


0.5842 ± 0.0003 


Relaxed 


478 


-7.981 ± 0.004 


0.5780 ± 0.0003 


Merging 


141 


-8.409 ± 0.008 


0.6093 ± 0.0006 


Minor 


55 


-8.379 ± 0.013 


0.6068 ± 0.0009 


Major 


86 


-8.050 ± 0.011 


0.6163 ± 0.0008 



4.1. Distribution of intrinsic scatter 

Figure [3] shows the normalized distribution of the 
logarithmic deviations of temperature from the best-fit 
AZ500 - T->s:,500 relation at z = 0. The RMS scatter for our 
whole sample is 6.10%, which is smaller than the values 
obtained b y simulation s with cooling and heating, such 
as 13.6 % in lNagai et all (|2007aft and 20% in lO'Hara et~aTl 
(2006), who also used the spectroscopic temperature and 
true mass. It is difficult to directly compare with ob- 
served values because the observationally-estimated in- 
trinsic scatter is dependent on how the measurement er- 
rors are determined and th us displays a wide range in 
the literature, from 3.9 % in (|Arnaud et al.ll2005[) to 17% 
in (|Q'Hara et al.ll200ll . 

In the left panel of Figure [3] we compare the distri- 
butions of merging (red) and relaxed (black) clusters, 
while the right panel plots for minor and major mergers 
individually. The two distributions on the left display 
no apparent difference in their mean values. The stan- 
dard deviation for merging clusters appears to be larger 
than that of the relaxed clusters. To test the hypothe- 
sis that the merging and relaxed distributions differ, we 
performed the Wilcoxon Rank-Sum (R-S) test and the F- 
variance (F-V) test to see whether these two populations 
have significantly different mean values or variances, re- 
spectively. A small value (< 0.05 for a significance level 
of 5%) returned by the tests is often adopted to indicate 
a significant difference between these two populations. 
We performed the tests on different merging subgroups 
and summarize the results in Table [21 

The R-S test results show that the mean values do not 
differ significantly among all populations, which means 
that the intrinsic scatter is unbiased for merging and re- 
laxed clusters at z = 0. The F-V tests for all mergers 
and minor mergers show that their standard deviations, 
or the amount of scatter, are significantly larger than 
that of the relaxed ones. For clusters at z = 1, there is 
also no bias between merging and relaxed populations. 
The amount of scatter for merging clusters also tends 
to be greater than relaxed clusters, although only major 
mergers show a significant result. We will discuss the 
possible reasons for this trend in the next two sections. 
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Fig. 3. — Left: Normalized distribution of log scatter in the M5oo-Tx,5uo relation at z = for merging and relaxed clusters. The curves 
are the best-fit lognormal distributions. Merging clusters are plotted using open triangles and dashed lines; relaxed clusters are plotted 
using filled circles and solid lines. Right: Normalized distribution of log scatter and the best-fit curves for minor mergers (open squares 
and dotted lines), major mergers (open traiangles and dashed lines) and relaxed clusters (filled circles and solid lines). 



Since the form of scatter can affect the observed scaling 
relations, we also test the Gaussianity of the distribution 
of scatter in log space by fitting it with a Gaussian curve 
and using the Kolmogorov-Smirnov (K-S) test to see if 
the two distributions differ significantly. Again, a small 
value represents a significant deviation from the Gaus- 
sian distribution. The test results show that the distri- 
butions of scatter for all populations but minor merg- 
ers differ from a lognormal distribution at a significant 
level. There is an even stronger signature of deviation 
from lognormal at z = 1. We note that the deviation of 
the scatter from a lognormal distribution may affect re- 
sults from some of the self-calibration studies assuming a 
lognormal distribution of scatter (e.g.. lLima fc Hull2005f ) 
and should be taken into account to correctly interpret 
the obtained constraints on the cosmological parameters. 

4.2. Intrinsic scatter vs. halo concentration 

In the following two sections we will investigate the 
physical origin of the M^qq-Tx,5oo scatter by correlating 
it with cluster properties that are related to how the 
clusters are formed. In particular, we first show that the 
intrinsic scatter depends strongly on halo concentration. 
Then we discuss the contribution of scatter from recent 
merging events in the following section. 

Figure [5] shows a strong positive correlation between 
the scatter in the M^qq-Tx,5oo relation and the scat- 
ter in the M5oo-(i?20o/-Rsoo) relation. The correlation 
coefficient is 0.64, with a probability of zer o, given by 
the Spearman Rank-Order Correlation test (|Press et al.l 
f992, §14.6; probability of one means no correlation). 
To ensure that this result is not biased by the lower- 
mass clusters whose -R500 values are close to the reso- 
lution of the simulation, we raised the mass threshold 
to M500 > 10 14 M Q and repeated the analyses for these 
well-resolved systems. We found that for the 67 selected 
massive clusters, the correlation still holds, with Spear- 
man correlation coefficient 0.36 and probability 0.003. 
Note that we correlate with dlog(i?2oo/^50o) instead of 



the raw value of -R200 / -R500 because the latter is a func- 
tion of cluster mass. By doing so we exclude the effect of 
different cluster masses, focusing on the variation in halo 
concentrations. i?20o/^?500 is a monotonically decreasing 
function of the halo concentration parameter, usually de- 
fined as c = i?2oo/^?s, where R s is the scale radius of a 
cluster. Therefore, for clusters with similar masses, more 
concentrated clusters tend to lie under the mean mass- 
temperature relation, while the puffier clusters tend to 
scatter high. 

The relation between -R200/-R500 and c is less obvi- 
ous, so we derive their relat ion assumin g an NFW profile 
(Navarro, Frenk, & White fl995t fl996l . hereafter NFW) 
in the following. For a cluster that has an NFW density 
profile, the mass enclosed within a normalized radius of 
x = R/Rs is 



M(< x) = 4tt Ps R 3 s 



ln(l 



Anp s R 3 J(x), 



(G) 



where p s is the density at the scale radius R s . Also, the 
mass in the spherical overdensity definition is 



M A 



-in 



Ap crit (xAR s y 



(7) 



where A is the overdensity and p cr it is the critical density 
of the universe. Equating equation ^ and ([7]) gives 

J(XA) Pent 

which is a constant for the cluster under consideration. 
Therefore, the relation between £500 and £200 (or c, recall 
c = R2oo/Rs) is 



[ -. r ion 



7(2:500) 5 /(c) 



(9) 



We can use this relation to numerically solve for X500 as 
a function of c, and then the relation between i?2oo / R50Q 
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TABLE 2 

Significance tests on the distribution of scatter for different populations at z = and z = 1. The R-S and F-V test 

RESULTS FOR EACH SUBGROUP ARE RELATIVE TO THE RELAXED CLUSTERS. 



Subgroup 


z 


N 


Mean 




R-S Test 


0"rms(%) 


F-V test 


K-S test 


Relaxed 





478 


1.31 


X 


10" 


-2 




5.87 




2.5 


X 


io- 


-2 


Merging 





111 


8.67 


X 


io- 


-3 


0.134 


6.90 


0.014 


2.5 


X 


10" 


-2 


Minor 





55 


1.11 


X 


io- 


-2 


0.146 


7.21 


0.029 


2.0 


X 


10" 


-1 


Major 





86 


7.09 


X 


10" 


-3 


0.253 


6.72 


0.086 


2.5 


X 


io- 


-2 


Relaxed 


1 


102 


7.14 


X 


io- 


-3 




4.85 




2.6 


X 


io- 


-4 


Merging 


1 


121 


8.72 


X 


io- 


-3 


0.493 


5.71 


0.091 


6.5 


X 


10" 


-3 


Minor 


1 


46 


1.41 


X 


10" 


-2 


0.230 


5.25 


0.504 


8.0 


X 


10" 


-2 


Major 


1 


75 


5.45 


X 


io- 


-3 


0.290 


5.98 


0.049 


6.5 


X 


10" 


-3 




and c is simply 



-R500 2:500 (c) 

This relation is plotted in the right panel of Figure [4j 
We choose to use the parameter -R200/-R500 instead of 
the original halo concentration parameter c because it 
has two advantages. The first is to avoid introducing the 
uncertainty in fitting an NFW profile, especially for less 
massive clusters, since the fitting is very sensitive to the 
resolution in the central region of the cluster. Moreover, 
our analyses involve not only relaxed clusters but also 
merging ones, for which -R200/-R500 is actually more well- 
defined than c. 

One can understand why the correlation between the 
-^500 _ ^x,500 scatter and the halo concentration exists 
using the virial theorem. Consider the simplest case for 
an isolated system: 2T + W = 0, where T and W are 
the total kinetic and gravitational binding energy of the 
system, respectively. Then in general, 

ksTyir GM v i r , , 



where ks is the Boltzmann constant, /1 is the mean 
molecular mass of the gas, m p is the mass of a proton, 



and T V i r , M V i r , and R V i r are the virial gas temperature, 
mass, and radius of the system. Since the relations be- 
tween the virial quantities and the quantities in the over- 
density definition depend on individual cluster profiles, 
or halo concentrations, the Ma-Ia relation derived from 
above would have a normalization which is a function of 
concentration. This is why we expect the scatter in the 
-^500 _ 2x,500 relation to correlate with the concentration 
parameter. 

Halo concentrations have been shown to be r elated 
to the epoch at wh ich the hal o for med (NFW 119971 : 
IWechsler et "all 2002: IZhao et~aT1 2003; iNeto et al.1 2007). 
Since we have the mass assembly histories of all the sim- 
ulated clusters, it is straightforward to derive the clus- 
ter formation time based on the definition that a cluster 
"forms" when it first exceeds a certain fraction of its 
final mass. The commonly-adopted thresholds include 
10%, 25%, 50%, and 70%. Since more concentrated ha- 
los tend to form at higher redshift, a negative correla- 
tion between the M^q-Tx^oo scatter and the formation 
redshift is expected. Indeed we find significant negative 
correlations with the time when the cluster first reached 
one half, £1/2, and one quarter, £1/4, of its final mass 
(see Figure [S]). These correlations are not as tight as the 



0.6 



0.4 - 



0.2 



g" 0.0 



-0.2 - 



-0.4 - 



-0.6 








4 6 8 

L 1/2,lookbock 

(Gyr/h) 



10 



12 



0.6 



0.4 



0.2 



S 1 0.0 



-0.2 



-0.4 



-0.6 



1 

- 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

- 


- 


• 1,1,."" , ■* 

£> v.': . 

— - — j iH^4^^J^^^g^ ' 






. . . 1 . . 


. i ... i ... i ... i .. . 



4 6 8 

11 1 /4,lookbock 

(Gyr/h) 



10 



12 



Fig. 5. — Left: M&oo—Tx &qq scatter correlated with cluster formation time, defined as the time when a cluster first obtained half of 
its final mass. The Spearman correlation coefficient is -0.14, with a probability of 1.95 X 10 — 4 (probability of one means no correlation). 
Right: Same plot with x-axis being the time a cluster reached 1/4 of its final mass. The correlation coefficient is -0.11, with a probability 
of 9.4 X 10 — 3 . Both plots show that clusters that are formed at earlier/later times tend to scatter low/high. 
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Fig. 6. — M500— Tjt,500 relation corrected for halo concentrations. 
Notations are the same as in Figure [2] 

one with halo concentrations, probably due to the fact 
that the correlation between the halo concentration and 
the cluster formation time itself has a very large scat- 
ter, and also that the variation in halo concentrations 
cannot be fully accounted for by the variation in cluster 
formation time (|Neto et all 2007) . But the significance of 
these correlations with cluster formation times supports 
our finding that the scatter correlates with halo concen- 
trations. Therefore, we can say that cluster assembly 
histories leave imprints on the shapes of clusters at the 
present day that help to determine clusters' positions on 
the mass-observable scaling relation. 

The strong correlation in Figure [4] suggests that the 
variation in halo concentrations contributes to a signifi- 
cant amount of the intrinsic scatter in the Mbqq— Tx,500 
relation. It also implies that when we are provided the 



best-fit relation, dlogTx = 1.81 x d log (-R20o/-Rsoo) ; it 
is possible to use dk>g(i?2oo/i?50o) as a third parameter 
to normalize Tx in the M^qq-Tx^oo relation, and thus 
reduce the scatter. After removing the effect of halo con- 
centrations, we find that the RMS scatter decreases from 
6.10% to 4.49%, a reduction of ~ 26% of its original 
value. The corrected M^qq-Tx ,500 relation is shown in 
Figure [H 

We can explain the trend found in the previous sec- 
tion using this correlation too. Figure [7| shows the dis- 
tributions of halo concentration for merging and relaxed 
clusters at z — and 2=1. For both rcdshifts, the 
distribution of halo concentration for merging clusters is 
more dispe r sed th an for relaxed clusters, as also found in 
iNeto et ail (|2007l ). If the variation in halo concentrations 
is important to the M 500 -Tx,5oo scatter, as the above 
correlation suggests, then it is reasonable that merging 
clusters have a greater amount of scatter than relaxed 
ones. One may try to relate this trend to the dynami- 
cal state of clusters because the temperature excursions 
during mergers could also drive the scatter. However, in 
the next section we will show that this is not the case. 

In summary, we have found that the scatter in the 
Msqq-Tx^oq relation partly originates from the variation 
in halo concentrations, with more concentrated or early- 
formed clusters lying below the mean (they are cooler), 
while puffier clusters that are formed recently tend to 
be hotter than clusters with similar masses. Using the 
strong correlation between the scatter and halo concen- 
trations, the scatter can be greatly reduced to get a much 
tighter relation to be used for cosmology. The correlation 
can also explain the trend seen in the previous section 
that merging clusters have a greater amount of scatter 
than relaxed ones. Note that our simulation adopted the 
value of as = 0.74 from WMAP3 results, which is smaller 
than 08 = 0.796 from WMAP5. Although choosing a 
smaller rj 8 would decrease the average concentrat ion of 
clusters with a fixed mass (e.s. IDuffv eTaLl 12008). it is 



9 





Fig. 7. — Distribution of R200/-R500 for relaxed (solid) and merging (dashed) clusters at z = 
According to the F-V test, merging clusters have larger dispersions than relaxed clusters for both 2 
0.0044 and 0.0367, respectively. 



(left panel) and 2 = 1 (right panel). 
= and 2 = 1 datasets, with significance 



the variation of concentration that correlates with the 
scatter. Therefore the correlation should still hold if a 
higher og is used. 

4.3. Intrinsic scatter vs. recent mergers 

The other possible origin of the scatter in our simula- 
tion is the departure from hydrostatic equilibrium due to 
cluster mergers. Cluster mergers are among the most en- 
ergetic events in the universe. When two clusters merge, 
their gas is compressed and heated by merger shocks. 
This effect can boost the luminosity and temperature of 
the cluster a few times higher than its pre-merger value, 
as found in ideal merg er simulations (jRicker &; Sarazinl 
[20011: iPoole et~aI1f2007n . Therefore, our aim is to investi- 
gate how merger events statistically influence the cluster 
scaling relations. 

In order to see how merging events influence the ob- 
servable quantities of clusters, we correlate their mass- 
temperature scatter with their dynamical state. Two dif- 
ferent methods are used to quantify the dynamical state. 
The first is based on the actual cluster merger histories, 
where we use the time since last merger as an indica- 
tor. The second one is motivated by observations that 
unrelaxed clusters often have more substructures than 
relaxed clusters. We discuss results using both methods 
in the following. 

First we correlate the M^qo-Tx^oo scatter with the 
time since last merger, ti as t (thus clusters that just un- 
derwent mergers would have ti ast = 0). As shown in 
Figure there is a trend for more recently merged clus- 
ters to lie below the mean relation, but only the corre- 
lation for major mergers has a high probability. Note 
that although the scatter here is uncorrected, this cor- 
relation is not due to the effect of halo concentrations 
because halo concentrations work in the opposite direc- 
tion, as described in § 14.21 Therefore, this trend may be 
due to clusters that have just merged with a cool clump 
and are still on their way to virialization, as illustrated 
by the following example. Figure [9] is the time history 



constructed from our merger tree analysis for a cluster 
undergoing a minor merger. Here we plot the evolution 
of mass, temperature, and substructure measures ver- 
sus lookback time (t — for today). We can see that at 
t = 1.3 Gyr, a clump of cold gas merged into the primary 
cluster. The cold accreted gas caused a jump in mass and 
substructure measures, but it reduced the average tem- 
perature of the cluster. This behavior thus tends to make 
merging clusters lie below the Afsoo - 3jc,500 relation when 
they have just merged and then gradually move up when 
they become virialized. 

This trend is also seen when we correlate scatter with 
some of the substructure indicators, although the prob- 
abilities are not high. Figure [TO] shows the correlation 
with the two substructure measures that give the high- 
est probabilities, P2/P0 an d P3/P0 measured with a fixed 
aperture radius 1 Mpc. Here we use a fixed aperture size 
because this gives power ratios that are weaker functions 
of cluster mass than substructures computed using i?2oo 
or iisoo- This is to minimize covariance with halo con- 
centrations through the mass dependence. This negative 
correlation between scatter and substructures again sup- 
ports the idea that merging clusters tend to be cooler 
than clusters with similar masses. 

But do recent mergers cause a problem statistically? 
Again we want to compare the distributions of scatter 
for the merging and relaxed populations in the same way 
as in § 14.11 But as discussed earlier, the trends are proba- 
bly dominated by the effect of halo concentrations. Since 
we want to study the effect of recent mergers in isolation 
from other effects, we compute the statistics using the 
-^500~2x,500 relation after correcting for halo concentra- 
tions (Figure [6]) instead of the raw relation. In this way 
we can see whether the dynamical state of clusters is the 
second dominant factor in the scatter. 

The distributions of the concentration-corrected scat- 
ter for merging and relaxed clusters at z — are shown 
in Figure [TT] Again we use the R-S test and F-V test to 
detect whether there is any difference in the mean val- 
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Fig. 8. — M$oo-Tx 500 scatter versus the time since last merger for major mergers (left panel) and minor mergers (right panel) at z = 0. 
The Spearman correlation coefficients are 0.24 and 0.008, with probabilities of no correlation being 0.02 and 0.95 for major and minor 
mergers, respectively. 



ues and variances of these two populations. The results 
are summarized in Table [3l We find that after removing 
the effect of halo concentration, the trend that merging 
clusters have a larger RMS scatter becomes insignificant. 
This supports our earlier statement that the behavior 
of the raw scatter is determined more by the distribu- 
tion of halo concentrations than by the dynamical state 
of clusters. As for the mean values, there is a signif- 
icant relative bias for merging clusters to have smaller 
means than relaxed clusters at z = 0. This bias can be 
due to the incomplete virialization of merging clusters 
we just described. However, it is also possible that we 
have over-corrected for the halo concentrations for merg- 
ing clusters in comparison with relaxed clusters, because 
merging clusters are generally less concentrated (or have 
larger i? 2 oo/^50o, see Figure [7]). 

In summary, by correlating the scatter with the dy- 
namical state of clusters using both merger tree and sub- 
structure analysis, we find a very weak trend that merg- 
ing clusters tend to be cooler than relaxed clusters with 
similar masses. But this trend has a minor effect on the 
statistical properties of the scatter, i.e., the scatter for 
the merging clusters is neither biased nor wider spread 
compared to the relaxed ones at z = and z = 1 (see 
Table [3]). This implies that the dynamical state is not 
the second most important factor that contributes to the 
scatter, but that still other sources need to be found. 
However, the fact that the distributions of scatter for 
the merging and relaxed clusters are indistinguishable 
even out to higher redshift is good news for using cluster 
scaling relations in cosmology. 

5. DISCUSSION 

5.1. Effect of dynamical state 

In the previous section we have shown that the dynam- 
ical state of clusters has very little influence on the overall 
scatter in the M^qq-Tx^oo relation. When we look at the 
merging population, there is even a tendency for merging 
clusters to be cooler than relaxed ones of similar masses. 



Although this can be explained by incomplete virializa- 
tion of clusters merging with a cooler clump, it is still 
somewhat contrary to our intuition that merger shocks 
can heat the intracluster medium and raise the tempera- 
ture of a merging cluster. We discuss the possible reasons 
for this result in the following. 

One reason is that sometimes the merger shock is not 
captured by the projected -R500 aperture. At the begin- 
ning of mergers, the shocks often occur in the outskirts of 
clusters. So in order for the shock to be captured inside 
-R500, either the shock has to propagate into the i?5oo re- 
gion of the main cluster, or the two clusters have to merge 
roughly along the line of sight in order to affect Tx,5oo- 
The cluster history shown in Figure [T^] is one example 
of such a case. At t = 1.3 Gyr, the substructure mea- 
sures increase, indicating the start of the merger event. 
The maximum temperature in the cluster is increased by 
the merger shock, but Tx,5oo is unaffected because the 
shock-heated gas lies beyond the projected aperture ra- 
dius i?5oo- Even if the merger shock is within -R500, the 
spectroscopic temperature is not as sensitive to shocks 
as the emission-weighted temperature because when the 
shock and other cooler gas in the cluster are projected 
along the line of sight, th e spectral fit tends to put more 
weight on the cooler gas (jMazzotta et al.|[2004D . 

Secondly, the duration of the temperature boost 
is typically only ~ 0.5 Gyr (Ric ker &: Sarazinl 120011 : 
iPoole etltll 120071 ). and hence only a fraction of the un- 
relaxed clusters are observed during the transient excur- 
sion period. Moreover, even for those clusters that have 
just undergone major mergers, the increase in mass and 
temperature are often comparable. For example, a 3:1 
merger would have a mass jump of Mf/Mi ~ 1.33 and 
a temperature jump of Tf /Ti ~ 2 depending on the im- 
pact parameter of collision. Thus clusters tend to evolve 
roughly parallel to the scaling relation, as also suggested 
by previous works (e.g. IPoole et al.ll2007t ). 

Finally, merging clusters are the minority population 
compared to relaxed clusters. The fraction of clusters 
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Fig. 9. — Example time history of a cluster that undergoes a minor merger. The solid (dashed) line is M500 (Mf a f) in the first panel, 
T500 (Tmax) in the second panel. The other panels show the evolution of the substructure measures. 



that had a merger within the past 3 Gyr is ~ 23% at 
z = and ~ 54% at z = 1, while major mergers are rarer. 
All these effects combined are responsible for diluting 
the influence of mergers on the scatter and making their 
distributions indistinguishable from relaxed clusters. 

5.2. Effectiveness of substructure indicators 

Substructure measures, such as centroid offsets and 
power ratios, have often been use d in earlier studies 
to identify unrelaxed clusters fe.g. O'Hara et all 120061 : 
iKav et all l2007t Ueltema et all |200J). Observationally, 



they are quantities that link most effectively to the dy- 
nami cal state of clusters (jMohr et al.lll993l : lBuote fc Tsail 
1996). In simulations, they are easy to derive and to com- 
pare with observations. However, they still have some 
limitations, such as the projection effect. 

In our study we have extended the analysis of the dy- 
namical state of clusters by constructing cluster merg- 
ing histories, because these provide more information 
about the true dynamics of clusters during mergers than 
morphology-based measures, which are subjected to ob- 
servational limitations. This is the first time that these 
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Fig. 10. — M5oo-Tx,50o scatter versus substructure measures P2/P0 and Ps/Pq. Relaxed/merging clusters are plotted in open/filled 
circles. Weak negative correlations are found for merging clusters with probabilities of no correlation being 0.08 for P2/P0 an d 0.34 for 
P3/P0. 
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Fig. 11. — Distributions of scatter at z = after correcting for the effect of halo concentrations. In the left panel, merging clusters are 
plotted using open triangles and dashed lines, and relaxed clusters are plotted using filled circles and solid lines. In the right panel, the 
merging clusters are separated into minor mergers (open squares and dotted lines) and major mergers (open traiangles and dashed lines). 



TABLE 3 

Significance tests on the distribution of scatter after removing the effect of halo concentrations for different 
populations at z = and 2 = 1. the r-s and f-v test results for each subgroup are relative to the relaxed clusters. 



Subgroup 


z 


N 


Mean 




R-S Test 


OYms(%) 


F-V test 


Relaxed 





478 


1.44 X 10" 


-2 




4.42 




Merging 





141 


4.27 x 10" 


-3 


0.012 


4.67 


0.403 


Minor 





55 


6.39 x 10~ 


-3 


0.058 


4.63 


0.612 


Major 





86 


2.92 x 10~ 


-3 


0.034 


4.72 


0.406 


Relaxed 


1 


102 


1.03 x 10" 


-2 




4.25 




Merging 


1 


121 


6.08 x 10~ 


-3 


0.329 


4.88 


0.151 


Minor 


1 


46 


1.30 x 10~ 


-2 


0.275 


4.96 


0.202 


Major 


1 


75 


1.86 x 10- 




0.145 


4.81 


0.241 
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Fig. 12. — Example time history of a cluster that undergoes a major merger. The solid (dashed) line is M500 (Mf a f) in the first panel, 
T500 (Tmax) in the second panel. The other panels show the evolution of the substructure measures. 
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Fig. 13. — Substructure measures P2/P0 an d w in x projection versus the time since the last < 5 : 1 merger. The filled and open circles 
are merging and relaxed clusters identified by the merger tree diagnostic, respectively. The two horizontal dashed lines mark the upper 
and lower 20% of all clusters that have the highest and lowest substructure values. The right most column in the figure are clusters that 
did not have any < 5 : 1 merger in their entire histories. 



two approaches have been compared to see how effec- 
tively the substructure measures can recover the true 
merging population. 

Figure [T3] shows the negative correlations between two 
of the substructure measures as seen in the x projection 
of the simulation box and the time since the last < 5 : 1 
merger for z = 0. The filled and open circles are merg- 
ing and relaxed clusters identified by the merger tree di- 
agnostic, respectively. The two horizontal dashed lines 
mark the upper and lower 20% of all clusters that have 
the highest and lowest substructure values. We can see 
that, indeed, recently merged clusters tend to have more 
substructure than relaxed clusters. However, the merg- 
ing and relaxed populations overlap over a wide range of 
substructure values because of the large variation in sub- 
structures even for clusters at the same dynamical state. 
Therefore, there is not a clean cut to separate these two 
populations using the substructure measures. 

In order to facilitate future studies in this area, we com- 
pute the completeness and contamination of merging and 
relaxed clusters defined by the substructure measures, as- 
suming those identified using the cluster merger histories 
represent the true populations. We refer by 'complete- 
ness' to the fraction of merging/relaxed clusters found 
using substructure among the true merging/relaxed pop- 
ulation, while 'contamination' is the fraction of clusters 
that are detected as merging/relaxed but are actually re- 
laxed/merging. In Table S] we summarize the results for 
different substructure measures. In the definitions us- 
ing substructures, clusters that lie above/below the 20% 
thresholds are defined as merging/relaxed. In order to 
see how much the results are affected by the projection 
effect, we list the values obtained using information from 
both one and all projections. In the latter case, we select 
clusters by comparing the maximum value among three 
projections to the selection thresholds. In other words, 
clusters are identified as merging if the most disturbed 
value among three projections is above the 20% thresh- 



old, and relaxed clusters must have their most disturbed 
value below the lower 20% threshold. 

By comparing the numbers in Table [H we find that 
in general P2/P0 and P3/P0 give similar results, and 
P(i)/Po is n °t as useful as the other two power ratios. 
The centroid offset, having the greatest completeness and 
the least contamination, is the most successful one among 
all measures. When different aperture sizes are compared 
for the power ratios, the results using aperture sizes of 
1 Mpc and i?2oo are similar, while using -R500 is gener- 
ally a little worse than the others both in completeness 
and contamination. When all three projections are con- 
sidered, merging and relaxed clusters are better distin- 
guished for all measures, with the values of completeness 
and contamination changed by ~ 10 — 20%. The cen- 
troid offset, w, improves the most when all projections 
are used. 

According to the above analysis, the centroid offset 
does a better job in distinguishing merging and relaxed 
clusters than the power ratios. This is probably because 
each individual power ratio is only sensitive to a cer- 
tain type of substructure. They are more powerful when 
combi ned to distingu i sh be t ween different morphological 
types (jBuote fc Tsailll995l [199(1 . The centroid offset, 
on the other hand, is a more general feature of all dis- 
turbed clusters. However, all the substructure measures 
have limitations. Their effectiveness is influenced by the 
viewing projection. More importantly, their values have 
a large variation even for clusters at the same dynamical 
state, as shown in Figure [T31 Thus only ~ 40% of the 
true merging clusters are detected, and ~ 25% of the re- 
laxed clusters. Among the detected clusters, ~ 50% of 
the "merging" clusters are actually relaxed, and ~ 15% 
of the "relaxed" clusters are actually merging. There- 
fore, although substructure measures are useful in dis- 
tinguishing the dynamical state of clusters, caution is 
still required to interpret the results correctly. 
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TABLE 4 

Completeness and contamination of merging and relaxed clusters identified using power ratios and centroid offsets. 
Clusters lying above/below the 20% thresholds are defined as merging/relaxed. For the upper half of the table, the 
selection is based on the values of substructure measures in one projection, while for the bottom half, clusters are 
found by comparing the maximum value among three projections to the thresholds. See the text for details on the 

definition of completeness and contamination. 
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17.6 


11.2 
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12.8 


12.8 


15.2 


23.2 


25.6 


17.6 
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5.3. Comparison with previous work 

Several studies have explored the effect of clusters with 
different dynamical states on the M—Tx scaling relation. 
Most previous works used substructure measures such as 
the power ratios and the centroid offset to quantify the 
dynamical state. The main difference in our study is that 
we directly analyze cluster merging histories to identify 
the recently merged clusters, which provides another line 
of evidence for our results in addition to those derived 
fr om the substr u cture measures. 

O'H ara et al.l (|2006l ) investigated the effect of merg- 
ers and core structure on the X-ray scaling relations for 
both observed and simulated clusters. For the observed 
sample, they found that cool core clusters and clusters 
with less substructure exhibit a larger amount of scatter. 
Their simulated clusters, on the other hand, have a ten- 
dency to have a larger amount of scatter for clusters with 
more substructure, though they argued that the evidence 
is weak. Since their simulations also do not include radia- 
tive cooling, we can compare directly with their results 
without worrying about other baryonic effects. We also 
find the same trend that merging clusters, which we have 
shown to have more substructure, have a larger amount 
of scatter. However, we further explore the origin of the 
intrinsic scatter and find a strong correlation with the 
halo concentration of clusters. We also show that the 
trend seen above is due to the fact that merging clusters 
h ave a larger va r iation in their concentrations. 

iJeltema et al.1 ()2008l ) studied the correlation between 
cluster substructures and cluster observables using hy- 
drodynamical simulations with non-gravitational heat- 
ing and cooling. Despite the difference of input bary- 
onic physics in the simulations and the definition of Tx, 
they found no dependence on cluster substructures in the 
M — T ew relation when the true mass is considered. Al- 
though we find negative correlations between the scatter 
and some of the substructure indicators, the probabili- 
ties are not high. The lack of correlation supports their 
result. However, they reported that there is a signifi- 
cant trend for the relaxed clusters to have lower temper- 
atures for their masses in the M — T ew relation measured 
within i?5oo, whereas we do not fi nd any significant bias , 
and an opposite trend is found bv lKravtsov et ail ([2006) , 



iNagai et al.l (|2007ah and iKav et ail (|2007ft . Taking the 
average relations bet ween T ew and Tx for merging and 
relaxe d clusters from lNagai et al.1 (|2007bl ). [jeltema et al.1 
( 2008) argued that the discrepancy cannot be explained 
by using different temperature definitions. However, we 
find that the bias between T ew and Tx is larger for merg- 
ing clusters than relaxed ones (see § 13. 2| , in the direction 
that can alleviate this discrepancy. Theref ore, the differ- 
ence b etween the conclusions reached by IJeltema et al.l 
(2008) and the others regarding the offset of merging and 
relaxed clusters may be due to the use of T ew instead of 

the spectroscopic tem perature Tx- 

On the other hand, fVentimigli a et al.l (|2008h have re- 
cently found significant negative correlations between 
the substructure measures and the scatter in the mass- 
temperature relation for their simulated clusters, both 
for the emission-weighted temperature T ew and the 
spectroscopic-like temperature T s i. They also found rel- 
ative offsets between merging and relaxed clusters in the 
mean scaling relation, that is, merging clusters tend to 
be cooler than relaxed clusters of similar masses. As 
discussed in their p aper, the different conclusion than 
IJeltema et all (|2008l ) may come from different implemen- 
tations of feedback mechanisms. Although the trends can 
be explained by incomplete relaxation of merging clus- 
ters, which we also observed, we do not find a signifi- 
cant separation in the normalization of the M — Tx rela- 
tion between merging and relaxed populations, especially 
when the intrinsic scatter without cooling is largely con- 
tributed by the effect of halo concentration. Therefore, 
the main reason resulting in the differences is probably 
due to radiative cooling, as is suggested by the fact that it 
is included in all the studies which observed the relative 
offset. Since current simulations with radiative cooling 
tend to produce relaxed clusters with steeper tempera- 
ture profiles than real clusters, the average temperature 
of relaxed clusters can possibly be biased high. This can 
explain why they found relaxed clusters hotter than ex- 
pected while our results do not show any significant offset 
between relaxed and merging clusters in the M — Tx re- 
lation. 

6. CONCLUSIONS 
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Galaxy clusters are invaluable cosmological probes. 
Accurate measurement of cluster masses is crucial and 
often relies on the mass-observable relations. However, 
to constrain the cosmological parameters to the few per- 
cent level, the systematics and scatter in these relations 
must be thoroughly understood. In this work we investi- 
gate the sources of intrinsic scatter in the M—Tx relation 
using a hydrodynamics plus N-body simulation of galaxy 
clusters in a cosmological volume. In order to compare 
directly to observations without worrying about different 
definitions or systematics, we produced mock Chandra 
X-ray images using MARX and extracted the spectro- 
scopic temperature Tx as observers do. Also, all the 
quantities in our analysis and discussions are measured 
within -R500, which is usually used for X-ray data. Ra- 
diative cooling and heating mechanisms are not included, 
since we would like to disentangle the scatter driven by 
the gravitational effects from other baryonic physics. We 
chose to focus on the M — Tx relation for several reasons. 
First, it is less sensitive to resolution and to cooling and 
heating mechanisms than other scaling relations, such as 
the Lx — Tx relation. Therefore, our results are still 
representative of reality even though the input physics 
is not complete. Second, the relative insensitivity to ad- 
ditional baryonic physics provides a window into better 
understanding of the physical origin of the scatter, de- 
spite our incomplete knowledge regarding the cooling and 
feedback mechanisms. Moreover, the intrinsic scatter in 
the M — Tx relation is among the smallest of all the 
observed scaling relations. If one can further reduce it 
based on the knowledge of its physical origin, the M — Tx 
relation will be extremely useful for cluster cosmology. 

Our aim is to find out what determines the positions 
of clusters in the M5Q0— Tx,500 relation, in particular 
whether the intrinsic scatter is driven by recent merg- 
ing activity or the overall assembly histories of clusters. 
We split our simulated cluster samples into merging and 
relaxed subgroups based on our merger tree analysis, and 
then we compare the distributions of the intrinsic scatter 
for individual subgroups. We also correlate the scatter 
with quantities that are related to the recent merging ac- 
tivity or cumulative cluster assembly histories, including 
the time since last merger, substructure measures, and 
the halo concentration. Here we summarize our findings. 

We find a strong correlation between the scatter in the 
-^500 - Tx,500 relation and the halo concentration. More 
concentrated clusters tend to lie below (cooler than) the 
mean relation, while puffier clusters tend to be hotter 
than expected for their masses. This is confirmed by the 
negative correlation between scatter and the formation 
lookback time of clusters, since it is well known that more 
concentrated clusters tend to form at earlier times. We 
showed that using this correlation, the scatter can be 
effectively reduced from 6.10% to 4.49%. 

There is no bias in the M^qq-Tx^oo relation between 
merging and relaxed clusters, but the amount of scatter 
for merging clusters is larger than that for the relaxed 
ones. This trend can be explained by the fact that merg- 
ing clusters have larger variations in their concentrations 
than relaxed clusters. 

When we correlate the scatter with the dynamical state 
of clusters, either the time since last merger or the sub- 
structure measures, there is a weak trend for recently 
merged clusters to be cooler, probably due to incomplete 



virialization of clusters that have just merged with a cool 
clump. However, statistically the influence of departure 
from hydrostatic equilibrium of merging clusters is neg- 
ligible. Possible reasons are discussed in detail in § 15.11 

There are significant deviations from lognormality of 
the distributions of scatter for our simulated clusters at 
both z = and z = 1. This effect should be taken into 
account in future self-calibration studies to correctly in- 
terpret the obtained constraints on the cosmological pa- 
rameters. Future simulation studies of larger volumes 
are needed in order to accurately characterize the distri- 
bution of scatter, including the tails of the distribution. 

In conclusion, we find that when radiative cooling and 
feedback mechanisms are neglected, the intrinsic scatter 
in the M^qq-Tx,5oo relation is driven more by the varia- 
tion in halo concentrations, or the overall assembly his- 
tories of clusters, than the recent merging events. Using 
an analytic approach, Bal ogh et al.l (|2006f) investigated 
whether the amount of scatter in the observed M — Tx 
and M — Lx relations can be explained by the variations 
in halo concentrations or different entropy floors. Al- 
though they focused more on the scatter in the M — Lx 
relation and showed that it requires a wide range of en- 
tropy floors, a significant portion of the scatter in the 
M — Tx relation is determined by the range of halo con- 
centrations predicted in their model. This is confirmed 
by our results since we have explicitly shown the strong 
correlation between the M — Tx scatter with the halo 
concentration using a numerical simulation. 

The lack of dependence of the scatter on the dynam ical 
state of clusters is also seen bv lO'Hara etall <f2006h for 
observed clusters. They suggested that the scatter in 
the mass-observable relations is not dominated by recent 
mergers, but by the cooling-related core properties or 
probably the overall assembly histories of clusters. The 
latter relationship is indeed found in our simulation. As 
for the effects of radiative cooling, it has been shown 
to be a main source of intrins ic scatter in the Lx — Tx 
relation ([Allen fc Fabianlll998l ). Therefore, our next step 
is to include cooling and feedback mechanisms in the 
simulation to examine their individual influence on the 
scatter in the mass-observable relations. 

Exploring the intrinsic scatter in the cluster scaling 
relations not only provides physical insights into the for- 
mation of galaxy clusters, but also has important im- 
plications for using clusters in cosmology. For example, 
the strong correlation with halo concentrations can be 
used for observed clusters to reduce the scatter in the 
scaling relations. Also, the weak influence of merging 
clusters is good news for cluster cosmology, because it 
implies that when deriving the scaling relations from 
the observed clusters, it is unnecessary to worry much 
about whether to include the unrelaxed systems or not. 
This is good because it is much more difficult to sepa- 
rate out the unrelaxed systems at higher redshifts. We 
expect that detailed studies of the intrinsic scatter in the 
scaling relations, not only in X-rays but also at other 
wavelengths, will continue to yield invaluable informa- 
tion both for cluster physics and cluster cosmology. 
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